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Abstract 

The SISCone jet algorithm is a seedless infrared-safe cone jet algorithm. There exists an im- 
plementation which is highly optimised for a large number of final state particles. However, 
in fixed-order perturbative calculations with a small number of final state particles, it turns 
out that the computer time needed for the jet clustering of this implementation is comparable 
to the computer time of the matrix elements. This article reports on an implementation of 
the SISCone algorithm optimised for low particle multiplicities. 



PROGRAM SUMMARY 

Title of program: siscone_parton 
Version: 1.0.1 
Catalogue number: 

Program obtained from: http : / /wwwthep .physik .uni-mainz . de/~stefanw/ software .html 

E-mail: stef anw@thep .physik . uni-mainz . de 

License: GNU Public License 

Computers: all 

Operating system: all 

Program language: C++ 

Memory required to execute: Negligible for low parton multiplicities. 
Other programs called: none 
External files needed: none 
Keywords: Jet algorithms. 

Nature of the physical problem: Clustering of particles into jets. 
Method of solution: Infrared-safe cone algorithm. 

Restrictions on complexity of the problem: Hard-coded restriction to 64 final state particles on 
64-bit machines, but recommended to be used only for configurations with up to 10 final state 
particles. 

Typical running time: Depending on the number of final state particles, (/lis) for configurations 
up to 5 final state particles. 
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1 Introduction 



The clustering of particles into hadronic jets is relevant to all current high-energy collider experi- 
ments. The clustering procedure is described by a jet algorithm. The available jet algorithms can 
be divided into two classes: Sequential recombination algorithms and cone-type algorithms. For 
a comparison between experiment and theory it is essential that a jet algorithm is infrared- safe. 
Infrared-safe observables can be calculated reliably within perturbation theory. The cone-type 
algorithms have suffered for a long time from not being infrared-safe. This problem has been 
solved with the invention of the SISCone jet algorithm, which is a seedless infrared-safe cone jet 
algorithm [1-3]. 

For a correct comparison between experiment and theory one has to use exactly the same 
jet algorithm in both cases. However, the situations in which the jet algorithm needs to be 
used are quite different: Experimentalists need to cluster of the order of one hundred particles, 
while in higher-order calculations within fixed-order perturbation theory one usually deals only 
with a few partons in the range from two to five or six. The original proposal of the SISCone 
algorithm in [2] described a method, which takes o(n-2 n ) time to cluster n particles. This 
is impractical in the experimental case. A major breakthrough was therefore the invention of 
a method, which performs this task in o(n 2 lnn) time [3]. In addition, the authors of ref. [3] 
clarified and fixed several ambiguities related to residual infrared unsafety. The implementation 
by the authors of ref. [3] has become the reference implementation of the SISCone algorithm. 
This method together with the corresponding implementation was mainly responsible for the 
fact, that experimentalists nowadays use frequently the infrared-safe SISCone jet algorithm. 

Let us now turn to the theoretical side: Next-to-leading order (NLO) or next-to-next-to- 
leading order (NNLO) calculations are usually done with the help of the subtraction method. 
This introduces a collection of subtraction terms for the real emission part. Each subtraction 
term has its own configuration of parton momenta, in general different from the other ones and 
different from the configuration of the matrix element it approximates. For jet observables this 
implies that the jet algorithm has to be run once for each subtraction term. For sequential re- 
combination algorithms like the k±- or anti-fc^-algorithms [4-8] this is not a problem, since the 
CPU time for the jet algorithm is negligible against the CPU time required for the matrix element 
associated with the subtraction term. However, in a recent comparison of different jet algorithms 
at NNLO in electron-positron annihilation [9] it turned out that the CPU time of the reference 
implementation of the SISCone algorithm was at the order of the CPU time of the matrix ele- 
ments. This slowed down the computation of the NNLO prediction for the SISCone jet rates 
significantly compared to the NNLO predictions for the jet rates with other jet algorithms, in 
particular when results for different cone sizes were calculated in parallel. (In ref. [9] the results 
for 100 different cone sizes where calculated in parallel.) To fix this problem, a dedicated imple- 
mentation of the SISCone algorithm optimised for low particle multiplicities has been written. 
These routines can be of use for any calculation within fixed-order perturbation theory using the 
SISCone algorithm. 

In this article I report on the implementation of the SISCone algorithm optimised for low 
particle multiplicities. The implementation includes a version for hadronic collisions as well as 
a version for electron-positron annihilation. The implementation is written in C++. Wherever 
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possible I kept the same syntax as the reference implementation of [3]. Efficiency issues require 
a few modifications of the syntax, which are all documented in this article. 

It should be clearly stated the implementation of this article is targeted for the case of a 
few final state particles. In this case the implementation of this article can give a significant 
performance boost, up to a factor 70 for two-particle configurations. For final states where the 
number of particles is at the order of one hundred, the implementation of [3] should be used. The 
cross-over occurs around 10 final state particles. 

This paper is organised as follows: In the next section I review the SISCone jet algorithm. 
Section 3 describes the optimisation techniques used in the low parton multiplicity case. Sec- 
tion 4 gives detailed information on how to use the program. The performance is reported in 
section 5. Finally, section 6 contains a summary and the conclusions. 

2 The SISCone jet algorithm 

In this section I review the definition of the SISCone jet algorithm [3]. There are two variants of 
the SISCone jet algorithm, one variant is adapted to hadronic collisions, while the other variant 
is adapted to electron-positron collisions. The hadronic version uses the rapidity y and the az- 
imuthal angle (]) as the basic variables and corresponds to a cylindrical geometry. The version for 
electron-positron annihilation uses instead the angle between particles as the basic variable and 
corresponds to a spherical geometry. I will first describe the common features of both versions 
and give the specific features for each version in the end. 

The SISCone jet algorithm depends on the four parameters R, f, n pass and v m i n . The most 
important parameters are the cone size R and the overlap parameter /. The parameter n pass 
specifies how often the procedure for finding stable cones is maximally iterated. This parameter 
can be set to infinity. The parameter v m i n defines the minimal value of the quantity v t hreshold 
(to be defined below) required for a jet. In addition to these four numbers we have to specify 
a distance measure R(i,J) between a particle i and a cone axis corresponding to the protojet J 
and three functions v t hreshoid(^)> v or d e ring(^) and v ove riap(^) related to the split merge procedure. 
The two versions for hadronic collisions and electron-positron annihilation will only differ in 
the definition of the distance measure and in the definition of the functions v t hreshoid> v or d er ing and 

^overlap- 

The SISCone jet algorithm is specified as follows: 

1. Put the set of current particles equal to the set of all particles in the event and set i pass = 0. 

2. For the current set of particles find all stable cones with cone size R. 

3. Each stable cone is added to the list of protojets. 

4. Remove all particles that are in stable cones from the list of current particles and increment 

l pass- 

5. If i pass < ipass and some new stable cones have been found in this pass, go back to step 2. 
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6. Run the split-merge procedure with overlap parameter / and threshold v, 



A set of particles defines a cone axis, which is given as the sum of the momenta of all particles 
in the set. A cone is called stable for the cone size R, if all particles defining the cone axis have 
a distance measure smaller than R to the cone axis and if all particles not belonging to the cone 
have a distance measure larger than R with respect to the cone axis. The four-momentum of a 
protojet is the sum of the four-momenta of the particles in the protojet. This corresponds to the 
E-scheme. Two protojets are called overlapping, if they share at least one particle. 

Let us now turn to the split-merge procedure: The split-merge procedure depends on the two 
parameters v m i n and /, and on the three functions v threshold* v or( iering and v ove riap- The split- merge 
procedure discards all protojets with v t hreshoid < v m i n , therefore v m i n is a threshold value a jet 
must have. In addition, the split-merge procedure requires an infrared-safe ordering variable 
for the protojets. This variable is denoted v or dering- Furthermore, if two protojets overlap, we 
need a quantity which describes the overlap. This quantity is denoted v ove riap- The split-merge 
procedure is defined as follows: 

1. Remove all protojets with v t hr es hold < v m in- 

2. Find the protojet / with the highest value of v or dering- 

3. Among the remaining protojets find the one (/) with highest value v or dering that overlaps 
with J. 

4. If there is such an overlapping jet then compute the quantities v 0V eriap(^n/) and v ove riap(7). 

(a) If Voveriap (^H 7) < /v 0V eriap(7) assign each particle that is shared between the two pro- 
tojets to the protojet whose axis is closest. Recalculate the momenta of the protojets. 

(b) If Voveriap (JH J) > /v ove riap(7) merge the two protojets into a single new protojet and 
remove the two original ones. 

5. Otherwise, if no overlapping jet exists, then add / to the list of jets and remove it from the 
list of protojets. 

6. As long as there are protojets left, go back to step 1. 

It remains to specify the distance measure R(i,J) and the three functions v t h re shoid(^)> v or dering(^) 
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and v verlap(^)- These quantities differ in the two versions of the SISCone algorithm. We have 
cylindrical spherical 



R(i,J) 



di,j = \/(yi-yj) 2 + (h-fo) 2 = 



arccos 



Pi ■ PJ 
\Pj\ 



Vthreshold(^) : P±(J) = 



^ordering (J) 
v overlap (-0 : 



keJ 

P±(J) = £ \P±,k\ 
k<EJ 

P±(J) = £ \P±M\ 

keJ 



E(J) = "£E k 



keJ 



i- |2 

PA 



E(J) = J £E k [ l + ^-sin 2 M 



keJ 

E(J) = "£E k 

keJ 



In this table the indices i and k refer to a particle, while the index J refers to the four-vector of 
a protojet. In the cylindrical version (which is appropriate for hadronic collisions) the distance 
measure is calculated from the rapidity y and the azimuthal angle (]). The threshold function 
v threshold (J) is given as the absolute value of the sum of the transverse momenta p^ k of the 
particles k making up the protojet J. The ordering function v or d er i ng (/) and the overlap function 
Voveriap(^) are given as the sum of the absolute values of the transverse momenta p± k of the 
particles k making up the protojet J. Since the main application of this implementation is within 
fixed-order perturbation theory infrared-safeness is essential. Therefore choices of the ordering 
function which are not infrared-safe are not considered. The choice for v or( iering (J) given above 
is the recommended one of ref. [3]. 

In the spherical version (which is appropriate for electron-positron annihilation) the distance 
measure is given by the angle between the three-momentum pi of particle i and the three- 
momentum pj of the protojet axis of the protojet J. The threshold function v t hreshoid(^) and 
the overlap function v 0V eriap(^) are given by the sum of the energies of the particles k making up 
the protojet J. The ordering function v or dering(^) is given by the quantity 



E(J) = 1 + 

keJ V 



PA sin 2 6w 



E 2 



(1) 



Note that this is the function, which is actually implemented in the code of [3], and not the 
function 



£^(l+sin 2 M ) 

keJ 



(2) 



3 The implementation for low particle multiplicities 

The standard implementation [3] of the SISCone algorithm is highly optimised for the experi- 
mental situation, where one faces the challenge to cluster of the order one hundred particles. In 
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this situation the asymptotic run-time behaviour of the algorithm is very important. The asymp- 
totic run-time behaviour is dominated by the time needed to find all stable cones and is O (n 2 \nn) 
for n final state particles in the implementation of [3]. The CPU time of the parts of the algorithm 
which do not grow so fast are less important if one deals with the order of one hundred particles. 

However, the situation is different if only a few final state particles need to be clustered. In 
this situation the asymptotic run-time behaviour is less important, and all parts of the algorithm 
have to be taken into account for the optimisation. It turns out that it is most efficient to use for 
the low particle multiplicity case a method which reduces significantly the computational cost of 
the parts which would be sub-dominant in the asymptotic regime. The implementation of this 
paper uses the method of [2] to find all stable cones. This method grows like O (n ■ 2") for n final 
state particles. This is acceptable since the intention is to use this implementation only in the 
case of small values of n. 

The implementation is written in C++. There are a few standard optimisation techniques 
which are used to make the code fast: First of all the intention is that this implementation is 
used within theoretical fixed-order calculations. The authors of fixed-order numerical programs 
usually have their own implementation of a four-vector class, which stores the momenta of the 
final state particles. Copying the data to a similar class which is used by the jet algorithm can 
take a significant amount of CPU time. This can be avoided by making the implementation of the 
SISCone jet algorithm a template. The implementation of this paper works with any four- vector 
class, under the condition that the four-vector class provides a few standard methods. A simple 
four-vector class which has all the required methods comes with the implementation. 

The input to the jet algorithm is a list (actually a std : : vector) of n four- vectors, represent- 
ing the n final-state particles. After clustering has been done, the content of a jet is represented 
by an unsigned integer. Within this unsigned integer, the y'-th bit corresponds to particle j. A 
value of 1 of the corresponding bit indicates that the particle belongs to the jet, while a value of 
means that the particle does not belong to the jet. In this way one can represent up to 64 particles 
in an unsigned integer on 64-bit machines (and up to 32 particles in an unsigned integer on 32-bit 
machines). However, these numbers should be taken as a hard-coded upper limit, beyond which 
it is not possible to use this implementation. In practise one would like to switch already earlier 
on to the implementation of [3] with the better asymptotic run-time behaviour. The advantage of 
using unsigned integers to represent jets is given by the fact that many operations (like merging) 
can be carried out through fast bit-operations. 

The implementation of the spherical version of the SISCone jet algorithm uses internally the 
distance measure 



yi,j = l-cos8 ir/ (3) 

instead of 0,,y. y ; j is a monotonic function of ; y in the interval [0,7t]. Using y ; y has the 
advantage that the distance can be calculated without any call to a trigonometric function. The 
value of y u is calculated as 

yi,j = l--prr--prT- ( 4 ) 
Pi PJ 
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This requires only a call to the sqrt -function for the normalisation of the spatial three- vectors p. 
All normalised spatial three- vectors are pre-computed at the start of the jet algorithm and stored 
in an array. 

In the cylindrical case the distance measure involves the rapidity y and the azimuthal angle 
(]). This involves a call to the log-function and a call to the at an 2 -function. These calls are 
rather expensive and lead to the effect that the cylindrical version requires more computer time. 
In order to avoid some of these function calls, the following optimisation is implemented in the 
program: At the start of the jet algorithm, all azimuthal angles are pre-computed and stored in 
an array, but the rapidities are not yet computed. In the search for stable cones some cones can 
immediately be discarded based on the information of the azimuthal angle alone. These are the 
cones J, which have a particle i belonging to the cone J, whose distance in azimuthal angle is 
already larger than the cone size R: 

djj = (<bi-<bjf>R 2 . (5) 

These cones are never stable, since adding the rapidity distance would only increase the distance 
measure. For those cones the calculation of the rapidity can be avoided. 

A second optimisation technique can be applied to events, which have a total p^-sum of zero. 
This is the case for pure QCD events, but not for mixed QCD-electroweak events like W +jets. 
If the total p^-sum is zero, only half of the values of the azimuthal angle (and of the transverse 
momenta) need to be computed. The other half can be obtained from transverse momentum con- 
servation without a call to the at an 2 -function. In the implementation this additional optimisation 
can be turned on by setting a flag that the total p^-sum is zero. 

4 How to use the program 

The implementation of the SISCone jet algorithm is written in C++ and can be obtained from 

http : / /wwwthep .physik . uni-mainz . de/~stefanw/ software .html 
After unpacking, the directory will contain the four files 

siscone_parton .h fourvector.h example. cc example_spherical . cc 

The file siscone_parton .h contains the implementation of the SISCone jet algorithm, defined 
within the namespace siscone_jet_algorithm. The SISCone jet algorithm is implemented 
as a template class, therefore all code is contained in the header file siscone_parton . h. No 
compilation or installation is required, the implementation can be used by including the header 
file in the source code of the user. The version of the SISCone jet algorithm which is based on 
the cylindrical geometry and which is appropriate for hadronic collisions is implemented in 

template<class T> class siscone_parton 

The version based on the spherical geometry, which is appropriate for electron-positron annihi- 
lation is implemented in 
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template<class T> class siscone_spherical_parton 

In both cases the template argument is a class implementing four-vectors and denoted by T in 
the declarations above. The four-vector class must have a few standard methods, which are 
summarised in the following fragment of an example header file: 

class fourvector { 



public : 
fourvector (void) ; 



fourvector & sum_up (const fourvector & q) ; 
fourvector & rescale (double c) ; 



double transverse_momentum (void) const; 
double rapidity (void) const; 
double azimuthal_angle (void) const; 



double spatial_norm (void) const; 

double spatial_scalar_product (const fourvector & q) const; 
double energy_component (void) const; 

}; 

In detail, the four-vector class has to provide a default constructor, which initialises the four 
components of the four- vector with zero. The method sum_up (q) adds the four- vector q to the 
current four- vector. The method rescale (c) multiplies all components of the four-vector by 
the factor c. The default constructor and the two methods sum_up and rescale are required by 
both versions of the SISCone jet algorithm. 

The class siscone_parton which uses the cylindrical geometry requires in addition the 
following three methods: A method transverse_momentum ( ) which returns the transverse 
momentum with respect to the beam axis. With the beam-axis along the z-axis the transverse 
momentum is given by 

P± = \jp\+Py, (6) 

where p x and p y denotes the x- and the y-component of the four- vector p, respectively. Secondly, 
a method rapidity ( ) which returns the rapidity of the four-momentum. With the beam along 
the z-axis, the rapidity is given by 

y = ih.»±* (7) 
2 Pt-Pz 

where p t denotes the energy component and p z denotes the z-component of the four- vector p. 
Finally, a method azimuthal_angle ( ) which returns the azimuthal angle (|) of the four-vector 
with respect to the beam axis. This angle obeys the relation 

tan<|> = ^, (8) 

Px 
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if the beam-axis is again along the z-axis. The three methods transverse_momentum, rapidity 
and azimuthal_angle are not needed if just the spherical version of the SISCone jet algorithm 
is used. 

The class siscone_spherical_parton which uses the spherical geometry requires in ad- 
dition the following three methods: A method spatial_norm ( ) which returns the norm of the 
three spatial components of the four- vector: 

\P\ 2 = Pl+Py+Pl (9) 

Secondly, a method spatial_scalar_product (q) which returns the scalar product of the spa- 
tial components with the ones of the four-vector q: 

p-q = p x q x +p y qy+Pz<iz- (10) 

Finally, a method energy_component () , which returns the energy component p t of the four- 
vector. The three methods spatial_norm, spatial_scalar_product and energy_component 
are not needed if just the cylindrical version of the SISCone jet algorithm is used. 

The file fourvector .h gives an example of a four-vector class which has all the required 
methods. Since the required methods are rather simple, they can be implemented as inline 
functions. Therefore all code relevant to the four-vector class is contained in the header file 
fourvector . h. 

The SISCone jet algorithm based on the cylindrical geometry is initialised by 

siscone_parton<f ourvector> jet_f inder (n_max) ; 

The template argument fourvector should be replaced by the name of the appropriate four- 
vector class. The variable n_max gives the maximal number of final state particles. In a fixed- 
order perturbative calculation n_max is usually set by the number of final state particles in the 
real emission contribution. The constructor allocates the memory which is needed for the jet 
algorithm. Since the CPU time for the allocation and deallocation of memory is not negligible, 
it is recommended that within a Monte Carlo integration the constructor is called exactly once. 
The spherical version of the SISCone jet algorithm is correspondingly initialised by 

siscone_spherical_parton<f ourvector> jet_f inder_spherical (n_max) ; 

In both versions the jet algorithm is invoked by a call to the method 

int compute_jets (const std : : vector<f ourvector> & particles, double R, 
double f, int n_pass_max=0, double v_min=0.0); 

The input parameters of the method are: a list of four-vectors of the final state particles, given 
as a std: : vector<f ourvector>. The input parameters R and f denote the cone size and the 
overlap parameter, respectively. The maximum number of passes is given by n_pass_max. A 
zero value for n_pass_max is treated as infinity. This is the default choice for n_pass_max. The 
minimal value of the threshold parameter is given by v_min. In the cylindrical version this is 
the p^-value, in the spherical version it is the energy E. The default choice for this parameter is 
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zero. The name and the arguments of the method compute_jets are identical to the ones in the 
implementation of [3]. This should make it easy to switch between the two implementations and 
to use each implementation where it performs best. 

The method compute_jets returns the number of jets nj e t. In addition, the content of the jets 
can be found in the data member 

template<class T> class siscone_parton 
{ 

public : 

std: : vector<unsigned> final_jets; 

}; 

The class siscone_spherical_parton has also the data member f inal_jets. After a call to 
compute_jets this vector contains nj et elements. Each entry is an unsigned integer. If the j-th 
bit of entry i is set, it implies that particle j belongs to jet i. 
The class siscone_parton has in addition a method 

public : 

void set_f lag_total_pt_is_zero (bool flag) ; 

For events with a total p^-sum of zero, this flag can be set to true. The jet algorithm will then 
use special optimisations, which are valid if the total p^-sum is zero. This can be used for pure 
QCD events, but should not be used for mixed QCD+electroweak events like W + jets-events. 
The default choice is false. 

The file example . cc contains a simple example program, which shows how to use the jet 
algorithm. The listing of the test program is as follows: 

♦include <iostream> 
♦include <vector> 

♦include "fourvector .h" 
♦include "siscone_parton .h" 

int main() 
{ 

using namespace siscone_jet_algorithm; 

// intialise the SISCone algorithm for maximally 4 final state particles 
int n_max = 4; 

siscone_parton<f ourvector> jet_f inder (n_max) ; 

// define the parameters for the SISCone algorithm 
double R = 0.9; 

double f = 0.5; 

int n_pass = 0; 
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double pt_min = 5.0; 

// some final-state momenta 

std: : vector<f ourvector> particles (4) ; 

particles[0] = fourvector (1 . 87116, -1 . 08275, 1 . 33996, -0 . 730343) ; 
particles [1] = fourvector (33 . 3795, 1 . 00171, 28 . 1084, -17 . 9751) ; 
particles[2] = fourvector (32 . 4375, -12 . 8963, -13 . 154 , 2 6 . 6992 ) ; 
particles[3] = fourvector (22 . 3118, 12 . 9774 , -16 . 2 944 , -7 . 99375) ; 

// call the jet algorithm 

int n_jet = jet_f inder . compute_jets ( particles, R, f, n_pass, pt_min ); 
/ / output 

std::cout << "Number of jets = " << n_jet << std: :endl; 
for (int j=0; j<n_jet; j++) { 

std::cout << " Particle content of jet " << j << " : "; 

for (int i=0; i<n_max; i++) { 

if ( jet_f inder. final_jets [ j] & (l«i) ) std::cout << i << " "; 

} 

std::cout << std: :endl; 

} 

return 0; 

} 

The program initialises first the jet algorithm for maximally four final state particles. It then 
defines the parameters for the jet algorithm and a set of four final state momenta. The call to 
the method compute_jets invokes the jet algorithm. After the jets have been computed, the 
program outputs the number of jets as well as the particle content of each jet. The program can 
be compiled with the command 

g++ -o example example. cc 

Running the program will produce the output 

Number of jets = 3 

Particle content of jet 0:01 

Particle content of jet 1 : 3 

Particle content of jet 2 : 2 

We see that three jets have been found. Particles and 1 have been clustered together to form jet 
0. Jet 1 consists of the particle 3, while jet 2 consists of particle 2. 

The file example_spherical . cc is very similar to the file example . cc, but uses the spheri- 
cal version of the SISCone jet algorithm instead. The example for the spherical version uses the 
same set of momenta of the final state particles. Running the program example_spherical will 
produce the output 
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Performance SISCone (cylindrical version) 
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Figure 1 : Speed-up factor as a function of the number n of final state particles for the cylindrical 
version of the SISCone jet algorithm. Shown is the ratio of the CPU time of the implementation 
of [3] by the CPU time of this implementation. The blue line shows a ratio of 1 . The cross-over 
occurs between 9 and 10 final state particles. 
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5 Checks and performance 

In this section I compare the implementation of the SISCone jet algorithm of this article with 
the implementation of [3] as a function of the number n of final state particles. The number n 
of final state particles ranges in the comparison between 2 and 1 1 . For each n I generate O ( 10 6 ) 
random events and I cluster the final state particles with the two implementations of the SISCone 
jet algorithm. The two implementations find exactly the same number of jets for all events. Let 
us first consider in more detail the clustering based on the cylindrical geometry appropriate to 
hadronic collisions. Fig. 1 shows the speed-up factor as a function of the number n of final state 
particles. In this figure the ratio of the CPU time of the implementation of [3] by the CPU time 
of this implementation is shown. The absolute timings of the two implementations are shown 
in table 1 . It can be seen that the implementation of this article is faster up to nine final state 
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Table 1 : Average CPU time in /js for the clustering of n particles with the cylindrical version of 
the SISCone algorithm on a standard PC. 
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Table 2: Average CPU time in /js for the clustering of n particles with the spherical version of 
the SISCone algorithm on a standard PC. 



particles. A speed-up factor of 15 is obtained for configurations with two final state particles. 
This factor is for example relevant to a forthcoming NNLO calculation of inclusive single jet 
production in pp collisions. In this calculation most subtraction terms have configurations with 
two final state particles. Let us now consider the spherical version of the SISCone jet algorithm. 
Fig. 2 shows the corresponding plot for the spherical geometry, appropriate to electron-positron 
annihilation. The absolute timings of the two implementations are shown in table 2. Here, the 
speed-up factor is almost 70 for two-particle configurations and roughly 35 for three-particle 
configurations. The implementation of this article is faster up to 10 particles. The speed-up 
factor of 35 for three-particle configurations was essential in the recent NNLO calculation of 
three-jet rates in electron-positron annihilation [9]. Without this speed improvement the CPU 
time for the jet clustering would have out-weighted the CPU time of the corresponding matrix 
elements. 

One observes that the speed-up factor is higher in the spherical version than in the cylindrical 
version. This can be understood by the following two facts: First, as explained in section 3 
the spherical version can be coded in such a way that no trigonometric function needs to be 
evaluated in the main part of the algorithm. On the other hand, the cylindrical version requires 
the evaluation of log- and atan2-functions. It is therefore expected that the implementation 
of the cylindrical version requires more CPU time compared to the spherical version. In the 
implementation of this article this is indeed the case. The second reason is rather surprising: 
It turns out that in the implementation of [3] the cylindrical version performs better than the 
spherical version. 

6 Conclusions 

In this article I reported on an implementation of the SISCone jet algorithm, which is optimised 
for low particle multiplicities. This implementation can be used for theoretical calculations in 
fixed-order perturbation theory, where the number of final state particles is rather low. The imple- 
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Performance SISCone (spherical version) 
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Figure 2: Speed-up factor as a function of the number n of final state particles for the spherical 
version of the SISCone jet algorithm. Shown is the ratio of the CPU time of the implementation 
of [3] by the CPU time of this implementation. The blue line shows a ratio of 1 . The cross-over 
occurs between 10 and 1 1 final state particles. 
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mentation of this article performs better then the reference implementation of [3] up to roughly 
nine final state particles. For a higher number of final state particles the reference implementa- 
tion of [3] should be used. For a low number of final state particles the implementation of this 
article can lead to a speed-up factor up to 70. 
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